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Abstract — This paper introduces a robust point-to-point trans- 
mission scheme: Tetrys, that relies on a novel on-the-fly erasure 
coding concept which reduces the delay for recovering lost data 
at the receiver side. In current erasure coding schemes, the 
packets that are not rebuilt at the receiver side are either 
lost or delayed by at least one RTT before transmission to 
the application. The present contribution aims at demonstrating 
that Tetrys coding scheme can fill the gap between real-time 
applications requirements and full reliability. Indeed, we show 
that in several cases, Tetrys can recover lost packets below one 
RTT over lossy and best-effort networks. We also show that 
Tetrys allows to enable full reliability without delay compromise 
and as a result: significantly improves the performance of time 
constrained applications. For instance, our evaluations present 
that video-conferencing applications obtain a PSNR gain up to 
7dB compared to classic block-based erasure codes. 

I. Introduction 

Multimedia applications, even over best effort networks, 
are more and more pervasive today. This is the sign of 
an important need by end-users for such applications, no 
matter their location and the connection technology being 
used. If the networking conditions are sometimes appropriate, 
users might also experience long transmission delays and 
significant packet losses. When this happens, providing the 
level of data delivery timeliness and reliability required by 
multimedia applications seems to be really challenging ||7|. 
In this context, this work aims at providing a transport-level 
reliability mechanism, called Tetrys, compliant with real-time 
applications requirements and able to recover lost packets in 
a given time threshold. 

Currently there are two kinds of reliability mechanisms 
based respectively on retransmission and redundancy schemes. 
Automatic Repeat reQuest (ARQ) schemes recover all lost 
packets thanks to retransmissions. This implies that the re- 
covery delay of a lost packet needs at least to wait one 
supplementary Round Trip Time (RTT). However, this can 
be problematic if this delay exceeds the threshold of the 
application (i.e. the threshold above the application considers 
a packet outdated). 

A well-known solution to prevent this additional delay is 
to add redundancy packets to the data flow. This can be done 
with the use of Application Level Forward Error Correction 
(AL-FEC) codefl The addition of n — fc repair packets to a 

'AL-FEC codes ai'e FEC codes for the erasure channel where symbols 
(i.e. packets) are either received without any error or lost (i.e. erased) during 
transmission. 



block of k source packets allows to rebuild all of the k source 
packets if a maximum of n — k packets are lost among the n 
packets sent. In practice, only Maximum-Distance Separable 
codes (MDS), such as Reed-Solomon codes [13j|, have this 
optimal property, whereas other families of codes (like LDPC 
ifTs ! or Raptor codes ||20|) need to receive a few number of 
extra symbols in addition to the k strict minimum. However, 
if more than {n — k) losses occur within a block, decoding 
becomes impossible. In order to increase robustness (e.g. to 
tolerate longer bursts of losses), the sender can choose to 
increase the block size (i.e. the n parameter) with the price 
of an increase of the decoding delay in case of erasure. In 
order to improve robustness while keeping a fixed delay, 
the sender can also choose to add more redundancy while 
keeping the same block size with the price of a decrease 
of the goodput (which is not necessarily affordable by the 
application). These trade-off between (packet decoding delay, 
block length) and throughput are, for instance, addressed in 
|11|. Another approach is proposed in lfT4l where the authors 
use non-binary convolutional-based codes. They show that 
the decoding delay can be reduced with the use of a sliding 
window, instead of a block of source data packets, to generate 
the repair packets. However, both mechanisms do not integrate 
the receivers' feedbacks and thus, cannot provide any full 
reliability service. 

Finally, an hybrid solution named Hybrid- ARQ which com- 
bines ARQ and AL-FEC schemes is often used. This is an 
interesting solution to improve these various trade-off |fT9l . 
However, when retransmission is needed, the application-to- 
application delay still depends on the RTT which might be not 
acceptable with real-time applications. 

The present contribution totally departs from the above 
schemes. In fact it inherits from the following two independent 
works on erasure and network coding which have converged 
to an on-the-fly coding mechanism where feedbacks from the 
receivers are considered during the encoding process; 

• In ETI . Sundararajan et al. have proposed a coding 
scheme which includes feedback messages on the reverse 
path. The goal of this feedback path is to decrease the 
encoding complexity at the sender side without impacting 
on the communication transfer. This scheme allows to 
reduce the number of transmissions and as a result, 
the average decoding delay in the context of multiple 
receivers. In their evaluation, the authors neglect trans- 
mission delays and the resulting delays from the losses 
observed by different receivers. A noticeable contribution 
of their work is the concept of seen packet by which 
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the receiver acknowledges the degrees of freedom of 
the linear system corresponding to the received packets. 
This scheme has the main benefit of optimizing buffers 
occupancy while reducing the encoding complexity; 
• Independently, Lacan and Lochin also proposed in ifTSl an 
on-the-fly coding system using feedbacks in the context 
of point-to-point communications with high transmission 
delays. Basically, the principle is to add repair packets 
generated as a linear combination of all the source data 
packets sent but not yet acknowledged. This scheme 
was proposed in order to enable full-reliability in Delay 
Tolerant Networks (DTN) and more specifically in Deep 
Space Networks (DSN) where an acknowledgment path 
might not exist and where the experienced delay might 
prevent the efficient use of standard ARQ schemes. 
Unlike current reliability methods, these on-the-fly coding 
schemes allow to fill the gap between systems without re- 
transmission and fully reliable systems by means of retrans- 
missions. In our work, we propose to deeply investigate the 
recovery delay of the lost packets, which is one essential 
characteristic of these on-the-fly coding schemes, and we show 
that this delay is both tunable and independent of the RTT. The 
main contributions of this paper are the application of Tetrys 
lfT2l (augmented with the concept of seen packets 1211 ) to 
the context of real-time applications and the analysis of the 
performances achieved. 

We present the Tetrys mechanism in Section HI] and illustrate 
the simplicity of its configuration compared to FEC codes in 
Section [III] Then we demonstrate in Section |IV] that Tetrys 
offers significant gains compared to standard erasure coding 
schemes, in particular in terms of delay versus reliability 
trade-off in the context of video-conferencing. An exhaustive 
analytical study of the mechanism is given in Section [V] It 
is followed by a performance analysis in Section |Vl] that 
complements the experiments of Section |IV] and demonstrates 
that Tetrys is able to determine the minimal amount of 
redundancy required to fulfill the application requirements. We 
finally conclude this work in Section IVIII 

II. Proposal description 

This section describes the Tetrys mechanism and the integra- 
tion of the seen packet concept ^T\. We choose to introduce 
the main Tetrys principle in Section III- Al to allow the reader a 
quick understanding of the present coding scheme used while 
Section Hl-BI further details Tetrys internal mechanisms. 

A. Tetrys in a nutshell 

Let us start with a quick overview of Tetrys. The Tetrys 
sender uses an elastic encoding window buffer (denoted 
BS) which includes all the source packets sent and not yet 
acknowledged. Let Pi be the source packet with sequence 
number i. Every k source packets, the sender sends a (single) 
repair packet R{i..j), which is built as a linear combination 
(with random coefficients) of all the packets currently in 
BS. The receiver is expected to periodically acknowledge the 
received or decoded packets. Each time the sender receives an 
acknowledgment, it removes the acknowledged packets from 



Pi 


The i"* source packet sent 




A repair packet built as a linear combination of the 
source packets i to j: R(i..j) = Xli-i "^fe "''^fc 


k 


The number of source packets between the 
transmission of two repair packets 


n 


The total number of source plus repair packets for 
each group of k source packets is denoted n (to keep 
the same notation than FEC) and is always equal to 
k + 1 for Tetrys 


R 


The redundancy ratio: R = {n — k) /n = 1 — {k/n) 
where (k/n) is the code rate 


Ar 


The difference between the redundancy ratio and the 
packet loss rate: Ar = R — p 


P 


The packet loss rate (PLR) experienced 


L 



The average burst size in case of a Gilbert Elliot 
channel. This value equals to 1 in the particular case 
of a Bernoulli channel. Therefore this parameter also 
defines the type of erasure channel used 


p 


The i'" lost packet 


Fsack 


The feedback (i.e. acknowledgment) transmission 
frequency, at the receiver 


PS 


The sender's (elastic) encoding window, composed of 
source packets not yet acknowledged 


PR 


The receiver's buffer where the packets received and 
decoded are kept until they are no longer needed to 
decode 



TABLE I 
Notations. 



BS. A receiver can decode lost packets as soon as the rank 
of the linear system, which corresponds to the available repair 
packets, is higher or equal to the number of lost packets. In 
most cases, the decoding is successful as soon as the number 
of lost packets is lower of equal to the number of repair packets 
received. 

It results that: (1) Tetrys is tolerant to any burst of source, 
repair or acknowledgement losses, as long as the amount of 
redundancy exceeds the packet loss rate (PLR), and (2) the lost 
packets are recovered within a delay that does not depend on 
the RTT, which is a key property for real-time applications. 
These properties will be thoroughly studied in the remaining 
of this paper. 
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R(1,2) 



R(1..6) 



R(1..6) R(1..8) 



Fig. 1. A simple data exchange with Tetrys (k=2). 
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1) A simple data exchange: Figure [T| illustrates a simple 
Tetrys exchange. Here fc = 2 which means that a repair packet 
is sent each time two source packets have been sent. The right 
side of this figure shows the list of packets that are lost and not 
yet rebuilt, as well as the repair packets kept by the receiver in 
order to recover them. During this data exchange, packet P2 is 
lost. However, the repair packet i?(i,2) successfully arrives and 
allows to rebuild P2- The receiver sends an acknowledgement 
for packets Pi and P2, in order to inform the sender that it can 
compute the next repair packets from packet P3 . Unfortunately 
this acknowledgement is lost. However this loss does not 
compromise the following transmissions and the sender simply 
continues to compute repair packets from Pi. After this, we 
see that P3, P4 and -R(i..4) packets are also lost. These packets 
are rebuilt thanks to i?(i..6) and i?(i..8) since the number 
of repair packets becomes higher or equal to the number of 
losses. 



B. A broader view of Tetrys 

We now detail the key concepts of Tetrys, namely the 
encoding and decoding process, the notion of seen packet, 
and the use of acknowledgments. 

7 ) Encoding process: A repair packet is sent every k source 
packets. This packet is computed as a linear combination of 
all the source packets currently in BS, as follows: 



R 



■Pi 



(i 1) 

where all packets between Pi and Pj belong to BS, with al 
are coefficients randomly chosen in a finite field F^, and where 
the multiplication of a coefficient by a packet is defined in 
ifTTl . From a practical point of view, instead of transmitting all 
the coefficients along with the associated repair packet (which 
introduces a potentially large transmission overhead), we use 
a Pseudo-Random Number Generator (or PRNG, e.g. ||6l) and 
only transmit the seed which has been used. 

The k value is directly related to the code rate which is equal 
to This is of course a key parameter that should ideally 
be adjusted dynamically depending on the network conditions. 
For the sake of simplicity, the code rate is chosen fixed. In 
section [V^ we analytically detail the code rate and evaluate 
with simulations its impact on the overall performance. We 
finally provide some guidelines to correctly set this value in 
Section IVll 

2) Decoding process: Decoding (i.e. recovering lost source 
packets) consists in solving the system of linear equations 
currently available at the receiver side. The available source 
packets (received or decoded) are stored by the receiver as long 
as they might be used by the source to build the next repair 
packets R(i..j) while the repair packets are also stored as long 
as they can be used to recover lost packets. More precisely, 
when a new repair packet arrives, all the available 

source packets that are part of Pi .. Pj are subtracted from 
R{i..j)- The result is R{Li..Li)^ where (L1..L1) G {Pi..Pj) is 
the subset of packets of the linear combination that have been 
lost. 



Let us assume that the / source packets {L1..L1) have been 
lost and that I repair packets have been received and stored in 
BR. Let i?* be the i*'' packet of the set of I repair packets (for 
the sake of readability, this notation does not mention the set 
of source packets used by the linear combination). We obtain: 



with: 



(1) 



lost 



and where a£ is the coefficient used to encode the j*'' 
packet in i?*. If G can be inverted, the lost packets (Li. 
are recovered with: 

{Li,..,Uf = G-'-{R\..,RY 

Once the decoding is successful, all of these / repair packets 
can now be removed from BR. If the matrix G is singular, 
the repair packet whose coefficients are linearly dependent is 
discarded, and the receiver has to wait one more repair packet 
to do another attempt. 

A solution to improve the probability of having an invert- 
ible matrix could consist in using super-regular matrices lH). 
However the dynamic nature of Tetrys makes this solution 
complex to set up. Furthermore, it can be observed that with 
random coefficients, G has an extremely high probability of 
being invertible if the finite field is chosen sufficiently large 

3) Seen packet: A lost packet is considered as "seen" by 
a receiver when it receives a fresh repair packet built from a 
linear combination that includes this lost packet (i.e. the lost 
packet was part of the BS at the time the repair packet has been 
created). Even if a seen packet cannot be decoded immediately, 
the received repair packet contains enough information to 
recover this packet later. This explains why a "seen" packet 
acknowledges a source data packet as if it has been effectively 
received. Of course, when several lost packets are covered by 
one repair packet, only the oldest lost packet is considered as 
seen. 

4) Acknowledgment packet: A receiver periodically sends 
acknowledgment packets. Each acknowledgment contains the 
list (in the form of a SACK vector ifTSl ) of the packets seen or 
effectively received or decoded. Upon receiving this acknowl- 
edgment, the sender removes the acknowledged packets from 
the encoding window (BS). Therefore these packets are no 
longer included in the linear combinations used to encode the 
next repair packets II2TI . This reduces the encoding/decoding 
complexity. We choose to set the acknowledgment transmis- 
sion frequency Fsack, as a function of the current RTT: 
FsACK = s X RTT where typical values for s are ranging 
from 0.25 to 2. While the choice of Fsack does not impact 
on the reliability of the mechanism, there is a trade-off 
to find between the increase of Fsack which reduces the 
encoding/decoding complexity (evaluated in Section [V-Db and 
the transmission overhead and acknowledgement processing 
cost. 
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Sender's buffer 
PI 
P2 PI 

P3 P2 PI 
P4 P3 P2 PI 

P5 P4 P3 P2 
P6 P5 P4 P3 P2 

P7 P6 P3 
PS P7 P6 P3 

P9 PS P7 P6 P3 
P10P9 

P11 
P12P11 




Sack(1) 



T Sack(2,4,5) 



Sack(2..S) 



Sack(9,10) 



Receiver's buffer 



R(1,2) 

R(1,2) P4 

R(1,2) R(2-4) P4 

R(1,2) R(2-4) P4 P5 

R(1,2) R(2-4) P4 P5 P6 

R(1,2) R(2..4) R(2..6) P1..3 P4..I 

P2 P3 P4 P5 P6 P7 

P2 P3 P4 P5 P6 P7 PS 

P3 P6 P7 PS P9 

P3 P6 P7 PS P9 P10 

P9 P10 P11 
P9 P10 P11 
R(11,12) P11 P12 



Fig. 2. A more elaborate data exchange, with selective acknowledgements 
and seen packets (k=2). 



5) A complete example: Let us consider the example of 
Figure |2] where we assume the receiver sends back ac- 
knowledgments to a fixed frequency Fsack- The sender first 
transmits packets Pi, P2 and ^(1,2)- Since the repair packet 
i?(i_2) is the only one to be received, the receiver considers 
that Pi and P2 have been either lost or delayed. Then, 
the receiver acknowledges packet Pi since R(i,2) contains a 
linear combination of Pi which is considered as "seen". More 
generally, each time a repair packet is received, the receiver 
can acknowledge one of the source packets that are included 
in the linear combination. Then, the sender transmits P3 and 
P4. Just after, the sender receives an acknowledgement for 
packet Pi. So the sender creates a new repair packet starting 
from P2: R(2.A)- The receiver gets P4 and P(2..4), meaning 
that the sender has received the previous SACK packet. Then, 
the receiver sends a new SACK packet which acknowledges 
P2, P4, P5. The receiver cannot rebuild packets Pi to P3 
since he did not receive enough repair packets. As a result, 
the receiver stores R(i,2) and R[2.a) for a future use. Since 
no loss occurs after that point, upon receiving a third repair 
packet, the receiver can now rebuild the missing packets. The 
received source packets included in the linear combination are 
subtracted, which results in P(i 2),P(2 4) '^(2 6) ^^'^^ 



{R(l,2), R[2.A)t R[2..Q) ) 



G-(Pi,P2,P3)^ 



with: 



G 



R(l,2 






-R(l,2 



t(2..4) 



a 



'P2 



((2.. 4) 
-"(2. .6) 



(2) 



where ap'' is the coefficient used to encode P~ within 
the repair packet R(i..j)- 

With the assumption that G is invertible, G^^ is obtained 
thanks to a Gauss-Jordan elimination and packets Pi to P3 



are given by: 

(-Pl,^2,P3)^ = G ■ (P(i,2): ^(2..4)) ^(2..6))^ 

These packets cab be then considered as decoded. However, 
before removing them from BR, the receiver must still wait 
the reception of P(3.6..8) to be sure that the sender will not 
use these packets anymore to build new repair packets. 

This example highlights the importance of several metrics: 
the decoding delay, the buffer size at the sender and at the 
receiver, and the number of operations needed to encode 
and decode. All these metrics will be studied and analyzed 
thoroughly in the Section IV-AI 

III. On the robusteness of Tetrys parameters 

COMPARED TO FEC 

We have previously stated that an advantage of Tetrys is to 
enable full reliability with a recovery delay independent from 
the RTT. Real-time multimedia applications have different 
requirements as they support a certain number of losses while 
accepting data delivery below a given delay bound. In this 
context, this section compares Tetrys with another usual loss 
recovery scheme namely FEC block codes. In particular, this 
section emphasizes the simplicity of Tetrys configuration (con- 
trolled by a single parameter) which remains robust whatever 
the network conditions. 

FEC block codes for the erasure channel are a usual way of 
mitigating packet losses (e.g. the goal of the IETF FEC Frame- 
work groupO aims to define a generic framework between RTP 
and UDP protocols in order to use various FEC codes). In the 
remaining of this paper, the term "FEC scheme" refers to: 
source packets that are sent as soon as the application makes 
them available. Furthermore, after the emission of k source 
packets, n ~ k FEC repair packets are sent instantaneously. 
Since we want to compare Tetrys with the best FEC scheme, 
we assume that the link bandwidth is sufficiently important 
to absorbe the burst resulting from the introduction of these 
n — k repair packets. This code is assumed to be MDS (i.e 
optimal in terms of correction capability). 

Because we are considering a per-block approach, the 
recovery of lost packets is only possible at the end of the block 
when exactly k packets have been received for this block. This 
of course introduces a delay that depends on the n parameter 
chosen. 

In a real conditions, the PER is not constant over the time 
and the two key parameters of FEC schemes: namely the 
block size and code rate, should be adapted. Unfortunately, this 
adaptation requires feedback information which by definition, 
are always constrained by the RTT. Thus, the information is 
always returned at least one RTT later and might not reflect 
the current network state. As a result, the FEC parameters 
currently used might not be optimal. 

On the opposite, Tetrys repair packets are uniformly spread 
among source packets. Then, packets lost can be recovered 
without waiting for the end of a fixed length block. 

^See|http://www.ietf.org/dyn/wg/charter/fecframe-charter.htinl| 
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Fig. 3. Cumulative Distribution Functions (CDF) of packets delivery delay for Tetrys (bold curve) and FEC (multiple staircase-like curves, con'esponding to 
various block size configurations) over different Packet Loss Rate (PLR) and an RTT set to 200ms. The redundancy ratio is set either to 0.2 (i.e. the code 
rate equals to 0.8, upper row) or 0.5 (i.e. the code rate equals to 0.5, lower row). With FEC schemes, the block sizes are set to k={4; 8; 12; 16; 20; 24; 28; 
32) (resp. k=[2; 4; 6; 8; 10; 12; 14; 16; 18; 20}) for the upper (resp. lower) row. 



A. Decoding delay performance evaluation 

In order to demonstrate the benefits previously cited, we 
provide some results showing the gain obtained by Tetrys 
compare to FEC in terms of decoding delay over several 
network conditions. 

We have implemented a Tetrys prototype in C language. 
For decoding, a Gauss-Jordan matrix inversion has been de- 
veloped, inspired from Luigi Rizzo's Reed-Solomon codec 
IfTTl . This algorithm is modified in order to determine, in 
the case of a singular matrix, the repair packet which is a 
linear combination of the other received packets. This packet, 
which is useless, is then discarded and the decoder waits for 
additional repair packets. In these experiments, the coefficients 
for the linear combination are randomly chosen on the finite 
field F256, except in Section \V-G3 1 where other finite fields are 
used. 

We drive several tests to compare Tetrys to FEC schemes 
in the presence of random losses. In these tests, we set the 
redundancy ratio to either R = 0.20 (i.e. the code rate equals 
to 0.8, upper row) or i? = 0.5 (i.e. the code rate equals to 
0.5, lower row). The Packet Loss Rate (PLR) is increased 
progressively to approach the R parameter The results are 
depicted in Figure [5] For each figure, we draw one FEC 
scheme per block size {n parameter) curves. The rationale is 
to compare Tetrys to the optimal FEC scheme configuration. 
For a fixed code rate given, we see that: 

• in all the studied cases, Tetrys provides full reliability as 
the CDF values tend to one. This is not the case for the 
different FEC schemes (mainly for short-length blocks 
codes); 

• the probability for Tetrys to decode below a given delay 
is higher than most FEC schemes configuration (indeed 
for a given delay, the Tetrys curve is higher). Note that 
the only cases where a FEC scheme obtains a better 
performance recovery for a given delay than Tetrys occur 



when this FEC scheme has a low correction capability (as 
shown in Figure |3(d)| i. 

These results remains valid for most packet loss rates (e.g. in 
Figures |3(a)]|3(b)] [3(c)] with PLR between 6% to 12%). How- 
ever, as the PLR becomes closer to the redundancy ratio (e.g. 
in Figure [3(d)l with PLR = 15% compared to R = 20%), 
Tetrys recovery delay increases while FEC schemes overtake 
Tetrys. As a matter of fact and concerning Figure |3(d)[ 
it becomes obvious that Tetrys redundancy ratio should be 
dynamically adapted. Furthermore, it can be observed that 
Tetrys does not reach better performances when the PLR 
is close to R. However we must point out that the main 
objective in this context is to reduce the delay recovery and 
not necessarily to optimize the bandwidth occupancy (with a R 
close to the PLR). The algorithm allowing both such dynamic 
adaptation and the minimization of the bandwidth occupancy 
is presented later in the paper Section [Vl] 

As a resume, Tetrys exhibits the same delay and resilience 
efficiency for most PLR while being more efficient than the 
best FEC schemes. Tetrys redundancy ratio only needs to 
be adapted when the loss ratio approaches the redundancy 
ratio. As we only use a single parameter, this one-dimensional 
problem is easily addressed (Section IVlb. 

IV. Video-conferencing applications 

Video-conferencing applications have three main character- 
istics: they are known to correctly perform when the end-to- 
end delay does not exceed 100 ms (see 1231 1221 ) in order to 
preserve interactivity. They are also characterized by their vari- 
able instantaneous bit rate (VBR). Indeed, Intracoded frames 
(I-frame), because they are coded from scratch, generate more 
data then predicted coded frames (P-frames), and even more 
than bipredicted frames (B-frames). Finally, loosing an I-frame 
has, in general, a worse impact on the experienced video 
quality than loosing a P or B-frame. 
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This has several impacts. First of all, FEC schemes are 
limited by their block size which must neither be too large 
(since it would impact the end-to-end decoding delay) nor too 
small (since it would reduce the robustness in front of loss 
bursts). Using both the optimal block size and redundancy ratio 
requires an intricate adaptation mechanism. On the opposite, 
Tetrys offers, as seen in Section |III] a better compromise 
between the decoding delay and the resilience than the best 
FEC scheme. 

In the presence of VBR sources as video, this behavior 
is furthermore confirmed as FEC scheme lack adaptability 
compared to Tetrys. Indeed, recovering from a given number 
of losses means waiting for the reception of the same number 
of repair packets. With Tetrys, since two consecutive repair 
packets are spaced with k source packets, when the instanta- 
neous packet rate increases during the transmission, the time 
needed to receive additional repair packets is reduced, and 
the probability to recover losses before the deadline increases. 
With video coded data, I-frames are the ones that will benefit 
the most from the adaptability of Tetrys. Although it could be 
considered only as a side effect of the Tetrys mechanism, this 
particularity has a major impact on the end user quality as the 
I-frames have the biggest weight in the video quality measure. 

In this sense, Tetrys acts as an Unequal Erasure Protection 
(UEP) scheme such as DAUEP or PET E). 

More generally, nothing would prevent the use of UEP 
schemes embedded in Tetrys just by allocating lower code 
rates to the set of important data or by nesting sources subsets. 
Hence, in this work we do not consider any of the FEC UEP 
schemes nor the Tetrys UEP schemes and let these aspects for 
a future work. 

A. Experimental Setup 

We compare Tetrys to FEC schemes of different lengths 
while keeping the code rate constant, namely (fc,ri) ~ 
(3, 4), (6, 8), (9, 12), (12, 16). Two model of losses are applied 
to the channel: the Bernoulli and Gilbert-Elliott models. In 
the Bernoulli channel, each packet is independently dropped 
from the others following a Bernoulli law of parameter p. 
In the Gilbert-Elliott model, the packet losses are correlated 
following the first-order two-state Markov chain given Section 
IV-G2I in Figure [TO] We use the latest ITU-T's video codec 
recommendation, H.264, and the JM 15.1 H.264/AVC software 
lH]. We drive our experiments on the Foreman sequence, in 
CIF size, with a frame skip of one picture resulting in a 
frame rate of 15 fps. One I-frame is inserted every 14 P- 
frames and B-frames are not used at all. The average bitrate 
is about 384 kbps at the output of the video coder and the 
coded stream is packed into packets of 500 bytes length. 
The maximal tolerable end-to-end delay is set to 200 ms, 
all the packets received after this due time are dropped. A 
total of 150 coded frames, corresponding to 10 seconds of 
video, is used. In order to obtain representative results, each 
test is repeated 20 times (corresponding to 3000 frames and 
200 seconds of video). This setup is derived from the common 
testing conditions mentioned in 1231 . For evaluating the video 
we use the Evalvid framework described in |lO| , where the 



video quality is measured with the Peak Signal to Noise Ratio 
(PSNR) metric. 

B. Experiments results 
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Fig. 4. The PSNR of Tetrys remain significantly higher than FEC under 
various loss rate and pattern. 



As expected, the results achieved are unequivocal: Tetrys 
clearly outperforms all the tested FEC schemes in all scenarii. 
For the average performances on the Bernoulli channel plotted 
in Figures |4(a)[ Tetrys achieves a gain of 7.19 dB at a loss 
rate of 15 % over the best FEC scheme (FEC 6 8) for this 
scenario. The drop in PSNR for Tetrys does not exceed 4 dB 
when the PLR increases from 5% up to 16%; hence ensuring 
the average PSNR is always above 30 dB. When full reliability 
is impossible because of high time-constraints, Tetrys allows 
graceful degradation of the video quality. The instantaneous 
performances given in Figure |5] clearly confirms this UEP-like 
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behavior of Tetrys. The 10 sec snapshot of the instantaneous 
PSNR in the bottom graph, shows that Tetrys retrieved 9 I- 
frames out of 10, whereas FEC scheme succeeded only 5 
times. 

As explained in the introduction of this section, 1-frames has 
a higher protection because of the longer decoding windows 
they have which is in turn possible thanks to their higher bi- 
trate. The top graph of Figure |5] shows the stacked histograms 
of the instantaneous packet rate calculated over bins of 100 ms 
large. This value corresponds to the delay bound from which 
we reduce the one way journey time (RTT/2). It is clear from 
this graph that 1-frames have much higher bitrates than P- 
frames. 

Consequently, it turns out that Tetrys is providing efficient 
transparent unequal protection to video data. It is worth noting 
that Tetrys is also delay/resilience efficient and it does not 
require any extra information exchanges (about data types, 
sizes, importance, etc) from the source application while most 
of the existing unequal protection schemes do. 

The performances of the Gilbert-Elliot Channel, plotted in 
figures |4(b)] and [4(c)] keep the same tendency even if the gains 
are less important when the burst length increases (3.78 dB 
for burst length of 2 and 2.72 dB for burst length of 3). 
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Fig. 5. Packet rate and instantaneous PSNR of a live video with PLR of 
15% on the Bernoulli channel 



V. Evaluation of the Mechanism Parameters 
A. Analytical model 

We propose in this part a model allowing to assess the key 
properties of the Tetrys mechanism. We assume the packet 
losses follow a Bernoulli law of parameter p. Under this 
assumption, we introduce a Markov chain: {Yn, n > 0}, which 
represents the difference between the number of lost packets 
and the number of received repair packets observed after the 
reception of each repair packet. As in section Sec. HIl we 
assume to decode when Yj = 0. This assumption is valid if the 
finite field is chosen sufficiently large (see |]9| for theoretical 
arguments and Section [V-G3I for simulation results). 

As a first step, we focus on the probability distribution of 
{Yn,n > 0}. Then, we use this distribution to estimate the 
decoding delay, the average buffer size and the computation 
complexity of the algorithm. 



Step in {Yn, n > 0} are done with a block granularity. We 
define a block as a set of /c + 1 consecutive packets that begins 
at the first source packet sent after a repair packet and ends 
at the next repair packet. 

The reception of each packet is represented by a random 
variable (r v.) Xi,j, where i > and ^ j ^ k. With this 
notation, i corresponds to the block and j to the position of 
the packet in the block. 

On the Bernoulli channel, we have P[Xij = 1] = p 
(the packet is lost), and P[Xi,j =0] = 1 — P (the packet 
is received). The variables Xij, where ^ j ^ A: — 1 
thus corresponds to source packets and the variables Xi^k 
corresponds to the repair packets. We then define the r.v. Xi, 
where i > 0, as follows: 

fe 

X, = (3) 

j=o 

Indeed, this sum can be expressed as Xi = Sj=o -^i-j + 
{Xi^k ~ !)• Then, the loss of one of the first k (source) packet 
increments the value of Xi while the reception of the repair 
packet decrements the value of Xi. Since Xi is obtained from 
a sum of Bernoulli variables, we have P{Xi = u — 1) = 
- with u = 0, . . . , fc + 1. 

We then define the Markov chain {Yn,n ^ 0} as follows: 

y _ ( Yn-l + Xn if Yn-1 + X„ ^ 

" " \ else 

From this definition, we can obtain the following transition 
probabilities: pij := P{Yn = j\Yn-i = i). Let us denote P 
the matrix {pi.j)i.j^o and Pi"' the entries of P". 

Let R = + 1) be the repair ratio. The analysis of 

this Markov chain shows that if i? > p, then any lost packet 
is recovered in a finite time. Indeed, the chain, which is 
irreducible, is defined over the non-negative integers. If R > p, 
the expectation of Xi, denoted by E{Xi), is negative and thus, 
it can be proved that the state is positive recurrent. Thus, 
a decoding event occurs in a finite mean delay. If r = p, the 
chain becomes null recurrent. This implies that any lost packet 
is decoded but the mean decoding delay is infinite. If R < p, 
the chain is transient and there is no guarantee to decode a 
lost packet. 

Let us consider the case where R > p. Since the chain 
is irreducible and one state is positive recurrent, it admits a 
stationary distribution P{Yj ~ i) for i,j ^ 0. This distribution 
can be obtained by: 

PiYj =i)^ lim pf^ 

for any z, j ^ 0. 

B. Decoding Delay 

To study the decoding delay, we first need to obtain the 
distribution of the first hitting time. In our context, the first 
hitting time is denoted by Ti and is defined as follows: 

= {mint such that Yt = Q\Yq = i} 
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Let us define 



and 



the probability generating function (p. g. f.) of T^. Following 
||3] chap. 2, lemma 25], we have : 

F,{z) = G,{z)/Go{z) 

The probability distribution of Ti can be then obtained from 
Fi{z) by evaluating: 

Since this Markov chain concerns the decoding delay at 
the block level, we now need to refine the analysis at the 
packet level. Let us consider that a packet sent in position j 
(j = 0, . . . , /c — 1) of a block i is lost. Let Dj be its decoding 
delay. This delay has necessarily the form k — j + z{k + 1) 
because the decoding can only be performed at the reception 
of a repair packet. 

Recall that Yi-i and Yi are the r v. representing the states 
of the chain {Yn, n ^ 0} after the previous block and at the 
end of the current block. 

Since the considered packet is lost, we have: 



P{Y, = y + u\Y,^i = y) = (^)K(l-p)* 



where u = 0, 



, k. We also have: 



P{Dj^k-j + zik+l)) 
^Ey>oELoPiD,=k-j+z{k + l), 



Y, 



y,Y,=y + u) 



= Ey^o E:=o PiDj = k-j + z{k + 1)/Y = y + u) 
P{Y, =y + u/Y,^i = y)P{Y^-i = y) 

P{Y, =y + u/Y^-i = y)P{Y^-i = y) 

Since all the probabilities are known, the probability distri- 
bution of D j can be then obtained. 



C. Matrix sizes 

Like most of erasure codes, the decoding operation in Tetrys 
basically consists in inverting a matrix defined over a finite 
field. The size of this matrix corresponds to the number of 
repair packets involved in the decoding. Compared to classic 
block-based erasure codes (rateless or not), the main difference 
is that there does not exist theoretical bounds on the size of 
the matrix that must be inverted. This is due to the concept of 
elastic coding window. On the other hand, thanks to the elastic 
coding window, it can be observed that, with a good choice of 
parameters, the sizes of the inverted matrices by Tetrys is most 
of the time lower than the matrices used by classic erasure 
codes. For these reasons, the study of the sizes' distribution 
of the inverted matrices is important. 



The first step in this study is the analysis of the recurrence 
time. This parameter is the time between the first loss after 
a decoding and its recovery. This time is expressed in time 
units, where a unit time corresponds to the delay between the 
transmission of two consecutive packets. 

With the notations introduced in the previous section, if we 
consider that at least one source packet is lost in a block, 
we can define the r. v. F which corresponds to the position 
of the first lost packet in the block. We have P{F = j) = 
p{l - py/{l - (1 - p)''), for j = 0, . . . , fc - 1. When the 
first lost packet occurs in position j, its recovery delay, and 
thus the corresponding recurrence time, denoted by U, has 
the form k — j + Z{k + 1), where Z represents the number of 
complete blocks included in the recurrence time. Reciprocally, 
a recurrence time equals to k — j + z{k + 1) can only be 
observed with a first loss at position j. 

Since the considered packet is the first lost after the previous 
decoding, the value of the next Yi is necessarily in the range 
[0,fc]. Thus, we have: 

P{U = k - j + z{k + l)) 

= ELo P{U = k-j + z{k + 1), = u/F = j)P{F = j) 
It follows that: 



P{U = k - j + z{k + 1)) 

= ELo PiD, =k-j+ z{k + = u) 

P{Y, = u/F = j)P{F = j) 
= E,^o Eto PiTu = z)P{Y., = u/F = j)P{F = j) 

It can easily be shown that P{Yi = u/F = j) = 
[^'J)p"-{1 -p)''^^^"-. The probabiHty distribution of U can 
be then obtained. 

To obtain the matrix size from U, we can first observe that 
in a recurrence time equals to k — j + z{k + 1), z + 1 repair 
symbols are sent. This means that the matrix size is ranging 
from 1 to z + 1. By considering that the last repair symbol is 
necessarily received, we have: 



p{z = i\u = k-j + z{k + i)) = ( ^ ]p'-\i-py 



It follows that: 



p{z = i) = E,^, E'=d Piz = i\u = k-j + z{k + 1)) 

P(U ^k-j + z{k + 1)) 

D. Analytical model of the buffer size 

Like for the matrix sizes, the elastic coding window of 
Tetrys implies that there is no theoretical bounds on the 
number of packets stored in the buffer at the sender and 
receiver sides. The aim of this part is to evaluate these 
parameters. In this section, we consider that a packet is sent 
by the sender each time unit. 

1) At the sender side: We denote by BSt the number of 
packets stored in the buffer at time t. Basically, the buffer 
contains the packets that were not acknowledged. Let S'l 
denotes the time between the reception of the last SACK and 
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t. If we consider that a SACK is sent every s.RTT time units 
and that it is lost with probabiHty p, we have : 

E{Si)^s.RTT{l/2 + l/{l-p)) 

The factor 1/2 corresponds to the average time to wait a 
received acknowledgment and the factor 1/(1 — p) is the 
expectation of the geometrical law of parameter p representing 
the arrival of the last SACK. 

This acknowledgment brings out the information on the 
reception of the packet sent by the sender one RTT ago. Thus, 
the sender has to store the RTT.k/{k + l) source packets sent 
during this period. 

Finally, at the time t — SI — RTT, some source packets 
were not acknowledged because they were lost. Thanks to the 
use of the ack-when-seen mechanism (included in the SACK 
mechanism), each received repair packet acknowledges a lost 
source packet. Thus, the number of not acknowledged source 
packets is the difference between the number of lost source 
packets and the number of received repair packets, which is 
represented by the r. v. Yn studied in Section IV-BI 

The average number of packets stored in the buffer is thus: 

E{BSt) = RTT{k/{k + l))(.s/2 + s/(l - p)) + E{Y^) 

Since the RTT does not impact on the value of E{Yn), we 
can observe that the number of packets in the buffer is thus 
linear and as a function of the SACK frequency. 

2) At the receiver side: The receiver has two buffers: the 
source buffer, which contains the received source packets 
necessary for future decoding and the repair buffer, which 
contains the received repair packets not yet decoded. The 
number of packets in the source buffer at the time t is denoted 
BRSt and the number of packets in the repair buffer is denoted 
BRRf. 

We recall that, when a source packet is received by the 
receiver, it is acknowledged in the future SACKs. When the 
sender received the first of these SACKs, it deletes this source 
packet in its buffer and does not include it in the generation 
of the next repair packets. The receiver can delete this source 
packet as soon as it received a repair packet which does not 
include this source packet in its linear combination. 

As shown in figure |6] it follows that the source packet is 
stored in the buffer during S2 + S3 + RTT, where S'2 + 
RTT/ 2 is the time needed by the sender to receive the first 
acknowledgment and 5*3 + RTT/ 2 is the time needed by the 
sender to receive the next repair packet. 

Clearly, S'2 follows the same law than 5*1. For 5*3, the same 
method can be used to estimate the mean, excepted that a 
repair packet is sent each k + 1 time units (instead of s.RTT 
for the SACKs). 

The average time spent by a source packet in the buffer is 
then: 

E{S2+S3+RTT) = RTT+{k+l+s.RTT){l/2+l/{l-p)) 

To obtain the number of packets stored in the buffer at a 
given time, we must consider that some of these packets are 
lost. Thus we have: 




Fig. 6. Receiver buffer 



E{BRSt) = (fc/(fc + 1))(1 - p)E{RTT + S2 + 53) 

= (fc/(fc + 1))(1 - p)RTT + (fc + 1 + s.RTT) 

((i-rt/2 + 1) 

To estimate the number of repair packets in the repair 
buffer, we can first estimate the probability of having no repair 
packet in the buffer. This probability is equal to P{Yn = 0) 
determined in Section IV-BI 

When there is at least one packet in the repair buffer, we 
can consider the probability distribution of the recurrence time 
U . Indeed, for U = k — j + z{k + 1), z repair packets are sent 
and we can estimate that, on average, (1 ~ p)z repair packets 
are received. It follows that the average number of packets in 
the buffer during this period is (1 — p)z/2. We then have: 

E^/DDD ^ i:,>o(i-p)^i:';o('=-j+^('=+i))^(t^='=-j+^(*=+i)) 

E[BRRt) - 2.P(F„^o) 

Following this model, we can assess the minimum buffer 
size requested by Tetrys. In addition, source-based algorithms 
can also be envisaged to prevent buffer overflow. 

E. Empirical evaluation of the buffer size 

In order to give an insight of the Tetrys requirements in 
a typical case, we evaluate the receiver buffer evolution with 
our Tetrys implementation. We report only experiments over 
a Bernoulli channel for the receiver's buffer as the receiver's 
buffer occupancy is always bigger than the sender. The RTT, 
repair ratio and sending rate are respectively set to 200ms, 
(3/4) and 100 packets per seconds. The two parameters that 
might affect the requested buffer sizes are the acknowledgment 
frequency (as presented Section and the PLR. We studied 
in Fig |7(a)| the impact of the acknowledgment frequency on 
the requested buffer size. Experiments are done with a fixed 
loss rate (10%). For the sake of completeness, we show the 
minimum, maximum and the (5, 10, 25, 50, 75, 90, 95) 
percentiles (the 50 percentile is the buffer size of the 50% 
highest buffer sizes) of the number of packets in buffer during 
the experiment. The samples used to compute these percentiles 
are selected at the reception of each data or repair packets. 

We can see that with one acknowledgment sent per packet, 
one per RTT and one for two RTT the 50th percentile are 

^The results ai'e in the same order of magnitude under bursty losses 
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Fig. 7. Minimum, maximum and (5, 10, 25, 50, 75, 90, 95) percentiles of 
the number of packets requested to decode with a 3/4 repair ratio for Tetrys 



respectively around 20, 30 and 40 packets. The points in 
Figure |7(a)| also give the mean value which overlaps the 
50th percentile. This confirms that as E{BRSt) suggests, the 
average number of packets kept in the buffer evolves linearly 
with the acknowledgment frequency. 

The other parameter of interest is the PLR, since we have 
seen that when its value is closed to the repair ratio, the 
recurrence time increases. Fig |7(b)| presents the result with 
an acknowledgment frequency of 1 and shows the number of 
packets in the buffer for a PLR varying from 1% to 20%. 
We can see that the (5, 10, 25, 75, 90, 95) percentiles remain 
close to their 50 percentile, implying a low number of packets 
in the buffer (most of the time around 30 ^ 40 for one 
acknowledgement per RTT) and a reasonable peak size (a 
maximum of 160 packets) the rest of the time. 



F. Complexity analysis 

These complexities are expressed in number of operations 
performed on packets. For example, the multiplication of a 
packet by a finite field coefficient or the addition (i.e. the XOR) 
of two packets are considered as one operation. 

1) Encoding Complexity: This complexity corresponds to 
the number of operations needed to generate one repair packet. 
Following the main concept of Tetrys, the number of source 
packets included involved the linear combination is the number 
of packets not acknowledged, i.e. the number of source packets 
in the buffer of the sender This quantity, denoted by BS, is 
studied in Section [V-DI 



2) Decoding Complexity: For each received repair packet, 
the decoder first subtracts the source packets of its source 
buffer, i.e. BRSt- This quantity is also studied in Section [V-DI 

For the decoding operation, the decoder has to invert a 
matrix of size Z (studied in Section [V-Cb . The inversion of a 
general matrix has a cubic complexity (in terms of operations 
on the entries of the matrix). It follows that the number 
of operations performed for each repair packet is of order 
0{BRSt + Z). 

G. Simulation results and analysis 
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1} Impact of the PLR: Figure [8] illustrates the variations of 
the mechanism performances in terms of average decoding 
delay, mean matrix size and mean recurrence time on a 
Bernoulli channel in the case where the repair ratio is fixed to 
0.25 and the PLR p. 

The error probability is represented in the x-axis and the 
matrix size, mean decoding time and recurrence time are 
represented in the y-axis. We used two scales in the y-axis. 
The first one (on the left side) is expressed in number of packet 
and is used for the mean matrix size. The second scale (on 
the right side) is expressed in time units. Recall that a unit 
time corresponds to the delay between the transmission of two 
consecutive packets. This time scale is used for mean decoding 
time and the recurrence time. 

The first observation is that the three curves increase with 
the PLR. This can be explained by the fact that when the 
error probability is very small compared to the repair ratio, 
then the decoding is done quickly and thus, the recurrence 
time, the decoding delay and the size of the inverted matrices 
are small. When the PLR grows up to the repair ratio, the three 
curves increase. It can be recalled from the previous Section 
that the average recurrence time is equal to \/{R — p) and 
thus, is infinite when R = p. 

We can also observe that the "decoding delay" curve be- 
comes larger than the "recurrence time" curve. This can be 
explained by the fact that the decoding delay is related to 
packet while the recurrence time is related to decoding. In the 
case of a large "recurrence walk", a large number of packets 
have a large decoding delay, and thus this walk has a larger 
influence on the average decoding time than on the average 
recurrence time. 
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2) Impact of the burst size: Figure |9] shows the influence 
of the burst of losses. We consider the weU-known first- 
order two-state channel model of Figure [TO] The input loss 
probabilities Pi and P2 vary in such a way that the mean 
PLR is kept constant (equals to 0.2). The repair ratio is fixed 
to 0.25. 




Fig. 10. The first-order two-state Markov chain representing the Gilbert- 
Elliott channel model 

The parameter Pi is represented on the x-axis. The value 
of P2 can be deduced from the PLR p and Pi . Indeed, it can 
be easily shown that the mean PLR of this model is equal to 
p = Pi/(1 + Pi — P2) and that the mean burst length is equal 
to 1/(1 - P2). Thus, P2 = l + Pl- Pi/p. 

Compared to Figure |8] the curve representing the mean burst 
length (equal to 1/(1 — P2)) is added. We can observe that 
a small value of Pi implies a large value of P2 and thus a 
large mean burst size. On the opposite, when Pi = P2, the 
Markov channel becomes a Bernoulli channel of parameter Pi 
and thus, the mean burst size reaches its minimum. 

The main information of the Figure|9]is that the burst losses 
have a negative impact of the matrix size, mean decoding time 
and recurrence time. We can observe that when Pi varies from 
0.1 to 0.2, the burst size varies from 2.5 to 1.25. In this range, 
the matrix size, mean decoding time and recurrence time are 
also divided by 2. 

Even this rate of 2 is very specific to this simple example, 
more generally, we can observe that the only consequence of 
bursts is the increase of the decoding delay, recurrence time 
and of the matrix size at the decoder side. Indeed, the property 
to decode all packets if R > p remains true. 

Note that in the case of channels with variable parameters 
(with a fixed PLR), our mechanism adapts naturally to the 
variable channel without external intervention. 

3) Impact of the finite field size: Figure [TT] shows the 
influence of the finite field size on the outputs. Indeed, it 



was shown in Section that the decoding is not necessarily 
possible as soon as the number of received repair packets 
is equal to the number of lost source packet. This can be 
explained by the fact that the corresponding matrix is not 
singular. In this case, the receiver must wait additional repair 
packets and then the delay and the matrix size are increased. 
In this simulation, the finite field size (on the x-axis) varies 
from 2^ to 2*. Recall that the coefficients used to build the 
repair packets are randomly chosen. 

The main result of this figure is that the two smallest finite 
fields (F2 and F4) obtain performances significantly worst than 
the others finite fields. Even if the binary field is attractive 
because all its operations can be implemented with extremely 
fast XORs operations, this field should be avoided in our 
mechanism. The best compromise seems to be the field Fg 
which obtain excellent decoding performance while supporting 
very fast operations. 

Following [41, a multiplication in the field ¥2^ (here m ~ 
3), can be implemented on average with m? /2 XOR operations 
(here 9/2). 

VI. Redundancy allocation under reliability and 

LATENCY CONSTRAINTS 

As for the video conference example, rather than full 
reliability, some multimedia applications require that a given 
proportion Pktmin of packets arrive within a tolerable delay 
Dmax (e-g- VoIP applications). After this delay, packets are 
considered as lost by the application although they might be 
delayed in the network and arrive later 

In order to verify whether the request given by an appli- 
cation defined by {Pktmin, Dmax) is feasible, we choose to 
infer a Tetrys heuristic model 9 following several experiments. 
We define this model as follows: 

m (d,p,b,T,FI.) (4) 

This model gives the cumulative distribution function of the 
lost packets recovery delay where R is the redundancy ratio 
for an application that produces a packet every T second^ 
according to the network characteristics (i.e. a delay d, a PLR 
p and a burstiness of losses b). 

''We assume a Constant Bit Rate (CBR) where all the packets have the 
same size. 
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We then test the capability of Tetrys to satisfy the request 
{Dmax, Pktmin), giveii R, with a boolean function denoted 
'i^g(^t){Dmax, Pktjnin)- ^ rctums TRUE if the probability 
that a packet arrives before Dmax is higher than Pktmin 
and FALSE otherwise. As a result, by iterating R (starting 
from R = p), we find the set of solutions that satisifies 
the application requirements. Finally, among these possible 
solutions Tetrys solves (|5]l to find the smallest redundancy 
ratio needed denoted Rmin- 



Scale 



Shape 



(5) 



In the following we present the method used to build this 
model. 



A. Model of the delay distribution 

The behaviour of the Tetrys mechanism can be modeled by a 
Markov chain process with a random walk driven by the losses 
of source packets and the reception of redundancy packets. 
Following Section IV-BI we could compute the recurrence and 
hitting times of the Markov chain and obtain an analytical 
model of 9. Unfortunately, the computation complexity of this 
model requires substantive computation time that prevents any 
implementation inside a real protocol. This motivates the use 
of our heuristic model 6 previously introduced. 

1) Experiments: We have performed several experiments 
with: 

1) a redundancy ratio R ranging from 0.1 to 0.5; 

2) a PLR p ranging from 1% to 50% which follows either a 
Bernoulli law or a Gilbert Elliot law with average burst 
size of 2 and 3; 

3) 10^ source data packets generated by each experiments. 

2) Distributions fitting: We seek to estimate the delay 
in number of packets sent (and supposed to be received) 
between a lost packet and the redundancy packet that rebuild it. 
Following the distribution of packets recovery delay obtained 
by the experiments, we find out that the Weibull law fits our 
distributior(§. 

A Weibull distribution is defined by two parameters: the 
scale and the shape. Such distribution captures both exponen- 
tial distribution if the shape parameter k is around 1 and the 
heavy tailed distribution if k < 1 and is defined as follows: 
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Fig. 12. Evolution of the scale (A) and (k) shape as a function of 

In the same way, the scale parameter is only impacted by 
n and the losses distribution. The scale can be approximated 
by: 

A(A«) - (7) 

with a and b some parameters related to the loss pattern and 
n. 



It results that 6 can be approximated by: 

X — e ^ ^(".p.<:) ' 



(8) 



with: 



X{n,p,c) = jrd^^, 

K{n,p,c) = ac,„ * (i -p) + bc,n, 

c the channel € ber, 62, &3, Qc^n and bc,n the appropriate 
values in the table |II] and Hill 
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Fig. 13. Comparison between the empirical distribution obtain by experi- 
ments and e [d, p, fe, T, R] ; T = 10ms, n=3 



B. Estimating the distribution parameters 

For a given loss distribution (e.g. Bernoulli or Gilbert Elliot) 
the delay distribution is impacted by n (fc + 1) and p (as 
A/? = ^ — p))- For each value of the block size n and 
each loss distribution the shape parameters evolves "linearly" 
as a function of as seen in Fig. [12] The linear function 
coefficients obtained through a least square are stored in table 

m 

'We used R |16| statistical software environment 



Figure[T3]presents the good fitting obtained by the empirical 
distribution of the delay obtained by experimentation and the 
expected distribution obtained with Q. The results are shown 
for a PLR of 15% and 20% with 6=1 (i.e. a Bernoulli erasure 
channel) and a PLR with 6 = 3 (a Gilbert Elliot losses with 
an average burst size of 3). 

C. Accuracy 

Figure [14] shows the results of the accuracy of Rmin (see 
((5]l)in a practical use case. The application emits at 100 pkt/s 
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Fig. 14. Comparison between the optimal (i.e. = Rrnin) and suboptimal 
(i.e. ^ Rmin) redundancy ratio. The metric is the probability for a packet to 
be received or decoded before the 300ms requested delay 

and request a minimum of (Pktmin, Dmax) — (0.95, SOOms). 
The losses in this example are uniformly distributed and 
the one-way delay is fixed to lOOms. The figure gives the 
probability for a packet to be received or decoded before 
a delay D„iax = SOOtos. When using Rmin = ^ (plotted 
with the strong full line) we can see for the various PLR, 
P[X < 300] is always higher than 0.95 which means that in 
this case, Tetrys satisfies the requirement of the application. 
When using Tetrys with R = (plotted with the dashed 
line) (e.g. "the highest smaller redundancy ratio") the results 
are mostly below 0.95. 

VII. Conclusion 

In this paper we propose a novel reliability mechanism, 
Tetrys, based on on-the-fly erasure coding techniques. We 
demonstrate, through a detailed modeling of Tetrys perfor- 
mance as well as real measurements, that Tetrys can achieve 
a full reliability service even in case of an unreliable ac- 
knowledgment path (thanks to the non sensitivity of Tetrys 
to the loss of acknowledgments), or as the extreme case no 
acknowledgment at all, while ensuring faster data delivery to 
the application than pure FEC based techniques. In particular, 
we demonstrate that Tetrys offers key benefits when used in 
the context of video-conferencing (and more generally real- 
time applications) over best effort networks. In this case, the 
main challenge tackled by Tetrys is to combat loss and delay 
in order to bring a substantial gain in terms of end user 
perceived quality. We show that Tetrys allows a faster recovery 
of missing information compared to block codes, and at the 
same time avoids non-useful retransmitted packets. Although 
the contributions of this paper deal with real-time data flows, 
Tetrys can also be used with non real-time applications, or 
at a different protocol layers. We expect to investigate these 
considerations, as well as the interactions between Tetrys and 
a congestion control mechanism, in a future work. 
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